Classification of emotional speech using spectral pattern features

نویسندگان

  • Ali Harimi Faculty of Electrical & Computer Engineering, Semnan University
  • Ali Shahzadi Faculty of Electrical & Computer Engineering, Semnan University
  • Alireza Ahmadyfard Department of Electrical Engineering and Robotics, Shahrood University of technology
چکیده مقاله:

Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram of speech signal using image processing techniques. For this purpose, details in the spectrogram image are firstly highlighted using histogram equalization technique. Then, directional filters are applied to decompose the image into 6 directional components. Finally, binary masking approach is employed to extract SPs from sub-banded images. The proposed HEs are also extracted by implementing the band pass filters on the spectrogram image. The extracted features are reduced in dimensions using a filtering feature selection algorithm based on fisher discriminant ratio. The classification accuracy of the pro-posed SER system has been evaluated using the 10-fold cross-validation technique on the Berlin database. The average recognition rate of 88.37% and 85.04% were achieved for females and males, respectively. By considering the total number of males and females samples, the overall recognition rate of 86.91% was obtained.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

classification of emotional speech using spectral pattern features

speech emotion recognition (ser) is a new and challenging research area with a wide range of applications in man-machine interactions. the aim of a ser system is to recognize human emotion by analyzing the acoustics of speech sound. in this study, we propose spectral pattern features (sps) and harmonic energy features (hes) for emotion recognition. these features extracted from the spectrogram ...

متن کامل

Features Importance Analysis for Emotional Speech Classification

The paper analyzes the prosody features, which includes the intonation, speaking rate, intensity, based on classified emotional speech. As an important feature of voice quality, voice source are also deduced for analysis. With the analysis results above, the paper creates both a CART model and a weight decay neural network model to find acoustic importance towards the emotional speech classific...

متن کامل

Emotional Features for Speech Overlaps Classification

One interesting phenomenon of natural conversation is overlapping speech. Besides causing difficulties in automatic speech processing, such overlaps carry information on the state of the overlapper: competitive overlaps (i.e. “interruptions”) can signal disagreement or the feeling of being overlooked, and cooperative overlaps (i.e. supportive interjections) can signal agreement and interest. Th...

متن کامل

Pattern Classification Using Composite Features

In this paper, we propose a new classification method using composite features, each of which consists of a number of primitive features. The covariance of two composite features contains information on statistical dependency among multiple primitive features. A new discriminant analysis (C-LDA) using the covariance of composite features is a generalization of the linear discriminant analysis (...

متن کامل

the effects of speech rate,prosodic features, and blurred speech on iranian efl learners listening comprehension

کلید واژه ها به زبان انگلیسی: effect of speech rate on listening comprehension, blurred speech,segmental and suprasegmental features,authentic speech,intelligibility, discrimination, omission, assimilation چکیده: سرعت مطالب شنیداری در کلام پیوسته بطور کلی همواره کابوسی بوده برای یادگیرنده های زبان دوم و بالاخص برای شنوندگان ایرانی. علی رغم عقل سلیم که کلام با سرعت کندتری فعالیتهای درک مطلب شن...

15 صفحه اول

Hyperspectral Images Classification by Combination of Spatial Features Based on Local Surface Fitting and Spectral Features

Hyperspectral sensors are important tools in monitoring the phenomena of the Earth due to the acquisition of a large number of spectral bands. Hyperspectral image classification is one of the most important fields of hyperspectral data processing, and so far there have been many attempts to increase its accuracy. Spatial features are important due to their ability to increase classification acc...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}


عنوان ژورنال

دوره 2  شماره 1

صفحات  53- 61

تاریخ انتشار 2014-06-01

با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023